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Abstract. Discrete temporal transitions occur in a variety of domains, 
but this work is mainly motivated by applications in molecular biology: 
explaining and analyzing observed transcriptome and proteome time se- 
ries by literature and database knowledge. The starting point of a formal 
concept analysis model is presented. The objects of a formal context are 
states of the interesting entities, and the attributes are the variable prop- 
erties defining the current state (e.g. observed presence or absence of pro- 
teins). Temporal transitions assign a relation to the objects, defined by 
deterministic or non-deterministic transition rules between sets of pre- 
and postconditions. This relation can be generalized to its transitive clo- 
sure, i.e. states are related if one results from the other by a transition 
sequence of arbitrary length. The focus of the work is the adaptation 
of the attribute exploration algorithm to such a relational context, so 
that questions concerning temporal dependencies can be asked during 
the exploration process and be answered from the computed stem base. 
Results are given for the abstract example of a game and a small gene 
regulatory network relevant to a biomedical question. 



1 Introduction 

Discrete temporal transitions occur in a variety of domains: control of engineering 
processes or roboters, flow of computer programs, a piece of music, games, etc. 
We are mainly interested in biological applications, but we develop a formal 
structure as widely usable as possible. 

The practical aim is to explain experimental time series in molecular biology 
or to hypothesize about temporal developments, especially in the context of gene 
expression regulation. Its first step is transcription, i.e. the synthesis of mRNA 
from a DNA sequence coding for a gene. Concentrations of mRNA for all genes 
of a cell culture (transcriptome analysis) can be measured by the rather new 
technique of microarrays (RNA binds to matching fragments of DNA or RNA 
fixed on a chip). The second step of gene expression is the translation of the 
mRNA into multiple identical proteins by ribosomes. Since the mRNA concen- 
trations are only weakly correlated to the respective protein concentrations, it 
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is recommended to also measure the latter, i.e. to perform proteome analysis. 
However, it is unfavourable that weakly expressed proteins remain undetectable. 
By complex - activating or inactivating - interactions of proteins within or be- 
tween cells (signaling pathways), a special class of proteins can be activated and 
- if necessary - transported to the cell nucleus. Those transcription factors again 
regulate the expression of a sometimes large set of genes. 

At a more global level, such cycles are described as gene regulatory networks 
(Figure[T]). One abstracts from biochemical activation processes of proteins; only 
the mRNA or protein level is considered as the main influencing factor. The 
indirect interactions between genes are positive (upregulation of expression) or 
negative (downregulation) . Regulatory networks may be constructed based on 
knowledge available by manual or automatic (text mining) literature search and 
in biological databases. 




Fig. 1. Gene regulatory network. — > upregulation, H downregulation. The 
information was obtained from the text mining software PathwayStudio 
[www.ariadnegenomics.com] and the manually curated protein interaction database 
Transpath [www.biobase.de]. 



The network determines the possible transitions between properties of gene 
products (mRNA or protein levels) ; as a first approximation they can be either 
present or absent. In the following we translate similar situations into the lan- 
guage of formal concept analysis (FCA), so that attribute exploration 3, 85ff.] 
can be applied. During this interactive algorithm, an expert is asked about the 
general validity of basic implications A — > B between the attributes of a given 
formal context (G,M,I). An implication has the meaning: "If an object g 6 G 
has all attributes a G A C M, then it has also all attributes b G B C A/." If 
the expert denies, he must provide a counterexample, i.e. a new object of the 
context. If he accepts, the implication is added to the stem base (Duquenne- 
Guigues base) of the context. At the end, all implications valid in the possibly 
enlarged context can be derived from the minimal set of rules contained in the 
stem base. Those are identical to the implications valid in the explored domain 
according to the knowledge available to the expert. 
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The present work is based on a FC A modeling of temporal transitions in [4] . 
The biological application is influenced by computation tree logic l a , Boolean 
networks [5] and qualitative reasoning [6 . Temporal concept analysis as devel- 
oped by K.E. Wolff [7] is more directed toward a description of temporal concepts 
than toward temporal logic. In future work, we shall investigate existing analo- 
gies and take advantage of them. 

2 Methods - Basic Definitions 

We start with two sets: 

— The universe E. The elements of E will be called entities. They represent 
the objects of the world which we are interested in. 

— The set F (fluents) denotes changing properties of the entities. 

A state of the universe is characterized by a unique value in F taken by every 
e G E; states with the same attribute values are identified!]] Therefore a state 
can be defined as a map <p : E — > F. If the state is not completely known, ip is a 
partial map. To explore static features of states, the following formal context is 
defined as a special case of a many- valued context [31 36ff.]. An example of an 
attribute exploration of a state context (defined as a single- valued context and 
with a slightly more general notion of a state) is given in [H 4.1.]. 

Definition 1. Given two sets E (entities) and F (fluents), a state context is 

a many-valued context (G, E, F, J) with G C {p : E — > F}; its relation J is 
given as (tp, e, /) G J p>(e) — f, for all ip G G,e G E and f G F. 

The class of these contexts is well defined; since ip is a map, the property of a 
many-valued context is fulfilled: (tp,e,fi) G J A (ip, e,/2) G J =^ fx = $2- If a 
many-valued attribute is regarded as a partial map from G into F, one can also 
write e(ip) = f. 

For each attribute e G E, a scale can be defined, i.e. a one-valued con- 
text S e := (G e ,M e , J e ) with e(G) C G e . Thus by plain scaling we derive from 
(G, E, F, J) the context (G, M, I) with 



If Ve G E : M e C F, we get M C E x F . This is the case e.g. for nominal, ordinal 
and dichotomic scales. For nominal and dichotomic scales, the relation / simply 
is defined by (pi (e, f) :<^4> <p{e) = /; the following text is based on this relation. 

1 They can of course be differentiated by introducing a new attribute, e.g. "time 
interval" . 

The definition of a relational context ((G, R), M, I) developed below corresponds to 
a labeled transition system with attributes, in the sense of [U Definition 1]. It has 
a single action "update" or "switch" and is trivially attribute defined 4 t Definition 




(1) 



eS-E 



ipl(e,m e ) :<^> e((p) = f and fl e m e . 



(2) 



2]- 
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Now we need a supplementary structure: a relation JJCGxG indicates 
temporal transitions between the states. A deterministic relation may be given 
by a family of elementary transition rules: preconditions / postconditions 
(V k ,N k ) keK , V k ,N k <Z M, so that 

(cp , ipi) G R & Vk G K : V k C <p' => N k C (3) 

In the non-deterministic case (e.g. for a game), different postconditions are 
possible. There is a class of families {(Vk, Nl) ke K\ I G L k for all k G if}, and 

VA; G K : V k C ^ => 3Z G L fe : N l k C ^ (4) 

The relational context ((G, R), M, I) can be represented by a binary power 
context family. Here we prefer the equivalent context, analoguous to [4] Definition 
4]: 

Definition 2. Given a state context (G, E, F, J) and a relation R C G x G, a 
transition context K, is the context (R,M x {0,1},/), M C E x F, with the 
property 

We {0,1}: (<po,<pi)i(e,f,i)&tpi(e) = f. (5) 

It appears promising to consider the transitive closure t(R) = {J n£ jqR n , i.e. 
<^o ^(-R) <£>i for any elements <po and of G, provided there exist ao, cti, a n G 
G with «o = VO) a n = fx, and ov-Rav+i for all < r < n. That means, the state 
ifl emerges from ipo by some transition sequence of arbitrary length. So we get 
a new transitive context 

Kt ■■= ((G,t(R)),M,I) = (t(R),Mx{0,l},I). (6) 

The relation I is defined like I in ((3J). 

Regarding this context, queries like the following are possible, for A, B,C C 
M, m G M (compare [U 37], [H 2020f.]). In a non-deterministic setting, the 
implications Q and © refer to all possible transition paths starting from a 
state ipo with all attributes b G -B. According to computation tree logic [I] 33], 
one could also ask if a path exists with the respective property. |(5J) expresses 
that in the future development of ipo, there will be a state with attribute m for 
at least one path. 

B -> never(m) (B x {0})' n (m, 1)' = (7) 

B -> cventually(m) ^ (B x {0})' n (m, 1)' ^ (8) 

B -» always(m) ^ (B x {0})' C (m, 1)' (9) 

3 stable state or oscillation « 3Bc M : (B x {0})' n(Bx {1})' ^ (10) 

Given a (partial) initial state A, can the system 
reach the state C while passing by another state Bl (11) 
«(Ax {0})' n(Bx {1})' ^ A (B x {0})' n (G x {1})' ^ 



5 



Those queries can also be checked for contexts modified by omitting some transi- 
tion rules. So one can investigate, if certain interactions are necessary for specific 
state transitions. 

The attribute exploration process has to be adapted, so that similar questions 
can be asked as implications during the exploration and be answered from the 
computed stem base. The following equivalences are straightforward: 

B -> nevcr(m) «Bx{0}U (to, 1) _L (12) 
B -> always(m) « Bx {0} — > (to, 1) (13) 

A counterexample has to be introduced into the context, if the temporal property 
in question is in contradiction to the data or to the desired behaviour of the 
system which is to be designed. 



3 Results - Two Examples 

In this section a state transition (<po,<Pi) is written as (<p m , f out ), and attributes 
arc noted as m m or m out instead of (m, 0) or (to, 1). 



3.1 3-pawns-chess 



In order to get a widely applicable view on discrete state transitions, the abstract 
case of a simple game is introduced. It resembles chess with only three pawns. 
The game is won when a pawn reaches the opposite side or when the opponent 
is blocked from further moves. Below are listed all states reachable from a state 
</?q™ (0. - two moves after the beginning), and the bar marks the next player. The 
following transitions are possible: 

(ip™, P5 Ut )> ifTi'Pe 1 *) (similar transitions are not listed); 

fair. ¥>?"*)■ 

In states 4, 5 and 7, black wins, in 6 white. 



0. 
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Our basic sets are E — ({a,b,c} x {1,2,3}) U {move, win}, F — {white, 
black}. G, the set of all possible states of the game, is a proper subset of {(f : 
E — > F}. Some examples of the attributes are al .white, move. white or win. black. 
The state context (G, E, F, J) is not complete, because in every situation there 
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are at least 3 empty fields, and not every state is a win-situation; there exist 
e G E, so that the domain D of the corresponding map e : D C G — > F is not 
equal to G. 

Starting from the context with the transitive relation for the states 0. to 7., 
the stem base was computed H Among others, the following of the 61 implications 
are of some interest (T denotes the empty set of preconditions, _L := M provided 
Af = 0): 

— T — » a3. black 171 , c3. black™, a3. black""*: a3 is always occupied by black, c3 
always but in the last step. 

— b2. black "* — * _L: b2. black "* characterizes an impossible game situation. 

— a2. white "*, move. white "* — ► c2. black "*, win. black "*: For white, this im- 
plication could be a warning not to move to a2. 

— a2. black "*, move. white "* — ► win. white "*: This confirms the tactic impor- 
tance of a2. 

— c3. black "*, move. white "* — > al. black "*, win. black "*: another winning con- 
dition. 

3.2 Gene regulatory networks 

We want to provide a temporal semantics for gene regulatory events, e.g. "genel 
upregulates the expression of gene2" . So the entities E are the interesting genes, 
and the fluents F = {abs, pres} = {-,+} are mRNA or protein levels. 

In this section, the biological application of the present approach is explained 
by the example of the 5 gene network of Figure[TJ We confine ourselves to a single 
measured time series of mRNA concentrations. It is part of ongoing biomedical 
research directed toward the understanding of complex molecular interactions 
relevant for the pathogenensis and therapy of rheumatoid arthritis (RA) . This 
disease putatively has autoimmune causes, and it is recognized that proteins 
like Tnfa and 111/3 - responsible for intercellular communication - have a ma- 
jor stimulating influence on the inflammatory process [2]. Therefore fibroblasts 
(particular cells of the joint) from RA patients were stimulated with Tnfa, and 
their expression was monitored by Affymetrix U133 Plus 2.0 microarrays before 
and 1, 2, 4 and 12 hours after stimulation. mRNA levels were grouped into the 
two classes absent and presently One resulting time course is shown in Table [1] 
as a transition context JC obs according to Definition [2] 

Now a corresponding knowledge based context will be developed. State tran- 
sitions are computed according to ((3]): all rules of one family are applied with 
preconditions matching the attributes of the input state (p m . The type of rules 
valid for particular genes is determined by the regulatory network (Figure [T]) . 
Table [2] lists some basic rule types; they are sufficient to compute the 2-gene 
transition context of Table [3l 

2 This is equivalent to an attribute exploration, where the expert accepts all implica- 
tions. 

3 For larger examples and datasets, a formal method will be selected, like the 
present /absent call of the gene expression chip, cluster analysis or minimization 
of intra group variance. 
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Table 1. Observed transition context K, obs . 
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Table 2. Transition rules (simplified notation). 



Nr. Meaning Rule 

1 Upregulation genel.pres — » gene2.pres 

2a Downregulation genel.pres, gene2.pres — > gene3.abs 

2b Failed downregulation genel.pres, gene2.pres — > gene3.pres 

2c No downregulation genel.pres, gene2.abs — > gene3.pres 

3 Degradation gene.pres — > gene.abs 

4 No effect gene.abs — > gene.abs 



Rules 3 and 4 are default rules; they are only applied to genes not occurring 
at the right side of another rule. Since the model abstracts from exact thresholds 
and time delays (which are rarely known), there are the alternative downregu- 
lation rules 2a and 2b. After one time step, upregulation or downregulation can 
prevail. (By the same reason, one could add to rule 3 the alternative gene. present 
— > gene. present.) The model is non-deterministic, the context K, of Table [3] shows 
the possible state transitions, starting from the initial state (p™ of the individual 
time series K, obs . It could also be relevant to investigate contexts containing the 
initial states of different observed cellular conditions, different patients or with 
all possible input states. 

Table 3. Knowledge based transition context K, for 2 genes. Example rules for 
(<pi n ,<P2 V *)- Tnfa.pres, Tnfaip3.pres -> Tnfa.abs (2a); Tnfa.pres, Tnfaip3.pres -> Tn- 
faip3.pres (2b). 



Transition Tnfa 1 ™ Tnfaip3 ln Tnfa out Tnfaip3°"* Applied rules 
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Implications of this context /C simply reflect the rules applied in order to 
compute a state transition. Deterministic transition rules even may be included 
in the stem base of the context, or they follow from it. (Of course, the stem base 
contains also implications in the inverse direction - from output to input at- 
tributes - or mixed implications like genel.pres out , gene2.abs m — > gene3.pres m .) 

The transitive context /Q is derived from K. by adding all supplementary ob- 
jects (tp m , (p out ) S t(R). An interactive attribute exploration of K. may be more 
intuitive than an exploration of ICt', the expert can compare the implications 
in question to the measured one step transitions of IC obs and eventually check 
them against supplementary knowledge. However, a time step of a knowledge 
based transition is not identical to a measurement interval; the problem is ag- 
gravated, if the intervals are different as in the present case. Therefore it seems 
more appropriate to explore the transitive context /Q immediately. Its implica- 
tions denote dependencies between attributes of states related by transitions of 
arbitrary duration. The following procedure was applied: 

1. Transform a time series of gene expression measurements to an observed 
context K, obs . 

2. For a set of interesting genes, extract transition rules from biological litera- 
ture and databases. 

3. Construct the transition context IC, starting from ip™ of IC obs . 

4. Derive the respective transitive contexts ICt and JC° bs . 

5. Perform attribute exploration of /Q. Decide about an implication A — > B by 
checking its validity in tC° bs and/or by searching for supplementary knowl- 
edge. Possibly provide a counterexample from K,° bs . 

6. Answer queries from the modified context IC t and from its stem base. 

For all 5 genes Tnfa, Tnfaip3, Icaml, Ccl4 and 111/3, a more complex set of 
transition rules had to be defined, which we shall not discuss here. 

In step [SJ automatic decision criteria could be tresholds of support q = 
\(A U B)'\ and confidence p = for an implication in JC° bs . A weak crite- 

rion is to reject only implications with support (but if no object in IC° bs has 
all attributes from A, the implication is not violated). In the present example 
a strong criterion was applied: implications of ICt had to be valid also in the 
observed context. This is equivalent to an exploration of the union of the two 
contexts. Its results, where all common implications were accepted by the ex- 
pert, are presented in Tabled] It has to be considered that the combined context 
generally only represents a transitive relation on the states for its subcontexts ICt 
and IC° bs . The main purpose of the proposed exploration is to make a falsification 
of the ICt implications possible. 

The subsequent implications are noteworthy and biologically meaningful: 1. 
is equivalent to always(Icaml.pres°"*). The same assertion for Ccl4 was falsified 
by the measurement; instead there are the new implications 2. to 5. and 15. to 
17. The static implications 6. and 10. to 13. reflect the very similar regulation of 
111/3, Icaml and Ccl4 (e.g. by 111/3 and Tnfa) and were also valid in the observed 
context. Likewise, 14. was supported by a priori and observed transitions. 7. to 
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9. and 19. to 21. mirror the important role of the upregulating genes 111/3 and 
Tnta: if 111/3, Tnfaip3 or Tnfa are upregulated at an arbitrary time point, either 
111/3 or Tnfa have been present in the past. 

Table 4. The stem base of the combined knowledge based and observed transitive 
contexts. Implications following from the previously entered background implications 
of the form gene.abs, gene.pres — > _L are not shown. The implications are presented in 
a short form proposed during attribute exploration by the Conlmp program (available 
at http://www.mathematik.tu-darmstadt.de/~burmeister/ConImp.tar), with basis of 
premise and/or reduced conclusion. 



Nr. Implication 



1. 


T 


— ► Icaml.pres out 


2. 


Illb.abs out 


-> Ccl4.pres 011 ' 


3. 


Ccl4.abs ollt 


-> Tnfa.pres*™ Tnfa.abs 011 ' Tnfaip.pr 011 * Ill/3.pres 011 * 


4. 


Tnfaip.abs 011 * 


-> Ccl4.pres out 


5. 


Tnfa.pres 011 ' 


-> Ccl4.pres ollt 


6. 


Ill/3.pres in 


— > Icaml.pres 1 ™ CcRpres 1 ™ 


7. 


Ill/3.abs in Ill/3.pres 011 * 


Tnfa.pres*™ 


8. 


Ill/3.abs in Tnfaip.pres 011 * 


— ► Tnfa.pres*™ 


9. 


Ill/3.abs in Tnfa.pres "' 


— ► Tnfa.pres*™ 


10. 


Ccl4.pres in 


— > Icaml.pres 1 ™ 


11. 


Ccl4.abs in 


— ► Tnfa.pres*™ Tnfaip.abs" 1 Icaml.abs 1 ™ Ill/labs 1 ™ 


12. 


Icaml.pres 1 ™ 


— ► CcRpres*™ 


13. 


Icaml.abs 1 ™ 


-> Ccl4.abs™ 


14. 


Tnfaip.pres 1 " 


— ► Icaml.pres 1 ™ 


15. 


Tnfaip.abs 1 ™ Icaml.pres ln 


— ► CcRpres 011 * 


16. 


Icaml.pres™ Ccl4.abs°"* 


-> Tnfaip.pres 1 " 


17. 


Tnfaip.abs 1 ™ Ccl4.abs ou * 


— > Icaml.abs 1 ™ 


18. 


Tnfa.abs*™ 


-» Icaml.pres*™ Ccl4.pres ollt 


19. 


Tnfa.abs*™ Ill/?.pres 011 ' 


— ► Ill/3.pres*™ 


20. 


Tnfa.abs*™ Tnfaip.pres "* 


— ► Ill/?.pres*™ 


21. 


Tnfa.abs*™ Tnfa.pres ollt 


-> Ill/3.pres*™ 


22. 


Tnfa.abs*™ Ill/3.abs*™ 


-> Tnfa.abs 011 * Tnfaip.abs 011 * Ill/3.abs ollt 



By reasoning over the stem base, hypotheses and predictions as results of 
similar implicational queries can be made, concerning transcriptome time series 
under equivalent experimental conditions to those of JC obs . A query B — > eventu- 
ally^) ([8]) is decided positively for an existing transition path, if B — * never(m) 
(p~2|) does not follow from the stem base. Set operations in the resulting context 
provide answers to further types of queries. It can be asked, whether a set of 
genes is in a stable state or shows an oscillatory behaviour (fT0|) . Answers to 
queries such as (JTTJ) can explain an observed 3-point time series. 

Altogether experimental data can be better understood, and reciprocally 
those are used for a validation of the implicational knowledge base during the 
exploration process. 
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4 Outlook 

A mathematically very interesting task will be the investigation of a new state 
context; its objects are states tp, and the attributes are more abstract temporal 
properties like eventually (Ccl4.pres) or oscillation^). We want to develop a 
set of background implications, so that implications of the new context can be 
derived from those of the transitive context. Also the dependency of a transitive 
from an underlying transition context will be investigated. A continuous task is 
to collect further meaningful biological questions that can be answered by our 
approach, and to develop a biologically more exact, comprehensive and realistic 
model. Thus it is planned to introduce finer steps than present /absent and to 
adapt the transition rules to this approach. Also a more precise definition of 
time intervals could be useful. Formal concept analysis is a mathematically and 
logically strict and rich theory, and we will further investigate its explanatory 
potential for temporal transitions. 
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